# Multi-dialect adaptation

Whisper Small Ta
Apache-2.0
This model is a speech recognition model fine-tuned on the Tamil Common Voice 17.0 dataset based on OpenAI's Whisper Small, with a Word Error Rate (WER) of 43.23%.
Speech Recognition Transformers Other
W
navin-kumar-j
38
1
Wav2vec2 Large Xlsr 53 Hungarian
Apache-2.0
An automatic speech recognition model fine-tuned on the Hungarian Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers Other
W
sarpba
17
1
Whisper With Augmentation Small Arabic With Diacritics
Apache-2.0
This model is a fine-tuned version of openai/whisper-small on Arabic datasets with diacritics, supporting Arabic speech-to-text tasks with diacritical marks.
Speech Recognition Transformers
W
mohmdsh
14
1
Whisper Tamil Large V2
Apache-2.0
Tamil speech recognition model fine-tuned based on OpenAI Whisper-large-v2, trained on multiple public Tamil ASR corpora
Speech Recognition Other
W
vasista22
325
7
Uzbek Stt
Apache-2.0
An Uzbek automatic speech recognition (ASR) model developed by the Oyqiz team, trained on the Common Voice 10.0 dataset
Speech Recognition Transformers Other
U
oyqiz
425
5
Whisper Small Pashto
Apache-2.0
A Pashto (ps) speech recognition model fine-tuned based on OpenAI Whisper-small, trained on the FLEURS dataset
Speech Recognition Transformers Other
W
ihanif
18
1
Dansk Wav2vec21
Apache-2.0
This model is a Danish speech recognition model fine-tuned by Siyam/SKYLy on the common_voice dataset
Speech Recognition Transformers
D
Siyam
32
0
Wav2vec2 Common Voice Tr Demo
Apache-2.0
This model is a speech recognition model fine-tuned on the Turkish Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers Other
W
YiTian
30
0
Sinai Voice Ar Stt
Apache-2.0
An Arabic speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m on the Common Voice Arabic dataset
Speech Recognition Transformers Arabic
S
bakrianoo
29
11
Wav2vec2 Large Xls R 300m Mongolian
Apache-2.0
An automatic speech recognition model fine-tuned on Mongolian datasets based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
W
infinitejoy
33
0
Wav2vec2 Large Xlsr 53 Spanish
Apache-2.0
A large-scale cross-lingual speech recognition model based on the Wav2Vec2 architecture, specifically optimized for Spanish, released by Facebook
Speech Recognition Spanish
W
facebook
66.63k
20
Xls R Kyrgiz Cv8
Apache-2.0
This model is a fine-tuned automatic speech recognition model based on facebook/wav2vec2-xls-r-300m on the Common Voice 8.0 Kyrgyz dataset
Speech Recognition Transformers Other
X
lucio
16
0
Wav2vec2 Large Xlsr Hindi Commonvoice
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the common_voice dataset, primarily used for Hindi speech recognition tasks.
Speech Recognition Transformers
W
nikhil6041
17
0
Wav2vec2 Large Xlsr Tamil Commonvoice
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice Tamil dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers
W
nikhil6041
43
0
Wav2vec2 Xls R 300m Gn Cv8 3
Apache-2.0
An automatic speech recognition (ASR) model fine-tuned on the Guarani (gn) Common Voice 8.0 dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition Transformers Other
W
lgris
17
0
Xls R Uyghur Cv8
Apache-2.0
An automatic speech recognition model fine-tuned on the Common Voice 8 Uyghur dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
X
lucio
24
9
Wav2vec2 Large Xlsr 53 Tatar
Apache-2.0
An automatic speech recognition model fine-tuned on Tatar language based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.
Speech Recognition Other
W
crang
163
1
Wav2vec2 Speechdat
Apache-2.0
This model is a Swedish automatic speech recognition model fine-tuned on the COMMON_VOICE - SV-SE dataset based on facebook/wav2vec2-large-xlsr-53.
Speech Recognition Transformers
W
birgermoell
29
0
Wav2vec2 Large Xlsr As
Apache-2.0
This is an automatic speech recognition model fine-tuned on Assamese based on the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.
Speech Recognition Other
W
anuragshas
30
0
Wav2vec2 Urdu
Apache-2.0
Urdu automatic speech recognition model based on wav2vec2 architecture, fine-tuned on Common Voice dataset
Speech Recognition Transformers Other
W
kingabzpro
101
3
Wav2vec2 Xls R 300m W2V2 XLSR 300M YAKUT SMALL
Apache-2.0
This is a speech recognition model fine-tuned on the Yakut (Sakha) language dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition Transformers Other
W
emre
90
0
Wav2vec2 Tr AG V1
A Turkish speech recognition model based on the Wav2Vec2 architecture, optimized for Turkish language.
Speech Recognition Transformers
W
adresgezgini
20
0
Wav2vec2 Xlsr Dhivehi
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on Dhivehi speech datasets based on the facebook/wav2vec2-xls-r-1b model.
Speech Recognition Transformers Other
W
sammy786
30
1
Wav2vec2 Xls R 300m Gn Cv8
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Common Voice 8 dataset based on the facebook/wav2vec2-xls-r-300m model, supporting Guarani (gn).
Speech Recognition Transformers Other
W
lgris
16
0
Xls R Ab Test
This is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE - AB dataset, based on the XLS-R architecture
Speech Recognition Transformers Other
X
pablouribe
17
0
Wav2vec2 Xlsr Breton
Apache-2.0
This model is a fine-tuned automatic speech recognition model for Breton based on facebook/wav2vec2-xls-r-1b.
Speech Recognition Transformers Other
W
sammy786
13
0
Wav2vec2 Large Xlsr 53 Sah CV8
Apache-2.0
A speech recognition model fine-tuned on the Common Voice Yakut dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers Other
W
emre
19
0
Wav2vec2 Large Xlsr 53 Es
Apache-2.0
A speech recognition model fine-tuned on the Spanish Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53 model, with a test WER of 10.50%.
Speech Recognition Transformers Spanish
W
pcuenq
147
0
Xls R Ab Test
This is an automatic speech recognition model fine-tuned on the Common Voice Abkhaz (ab) dataset based on the XLS-R architecture
Speech Recognition Transformers Other
X
baaastien
17
0
Xls Npsc
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the NBAILAB/NPSC - 48K_MP3 dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
X
NbAiLab
32
0
Wav2vec2 Large Xls R 300m Ab V4
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Abkhazian (ab) dataset based on Facebook's wav2vec2-xls-r-300m model
Speech Recognition Transformers Other
W
Arxived
16
0
Wav2vec2 Large Xls R 300m Br D10
Apache-2.0
This is a speech recognition model fine-tuned on Breton language dataset based on facebook/wav2vec2-xls-r-300m, achieving a 52.3% Word Error Rate (WER) on the Common Voice 8 test set.
Speech Recognition Transformers Other
W
DrishtiSharma
21
0
Bert Base Arabic Camelbert Mix Pos Glf
Apache-2.0
Gulf Arabic POS tagging model fine-tuned from CAMeLBERT-Mix, trained on Gumar dataset
Sequence Labeling Transformers Arabic
B
CAMeL-Lab
22
1
Xls R 300m Ur Cv7
Apache-2.0
This model is an Urdu automatic speech recognition (ASR) model fine-tuned on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - UR dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
X
HarrisDePerceptron
19
0
Wav2vec2 Xls R Urdu
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the Urdu Common Voice dataset based on Facebook's Wav2Vec2-Large-XLSR-53
Speech Recognition Transformers Other
W
Maniac
22
1
Wav2vec2 Xls R 60 Urdu
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the Common Voice Urdu dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers Other
W
Maniac
16
1
Hausa Xlsr
Apache-2.0
This is a Hausa automatic speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m, trained on the Common Voice 8.0 dataset.
Speech Recognition Transformers Other
H
Akashpb13
37
5
XLSR 1B Bokmaal Low
XLSR-1B-bokmaal-low is an automatic speech recognition (ASR) model focused on low-resource speech recognition tasks for Norwegian Bokmål.
Speech Recognition Transformers
X
NbAiLab
16
0
Bert Base Arabic Camelbert Mix Pos Msa
Apache-2.0
Modern Standard Arabic POS tagging model fine-tuned on CAMeLBERT-Mix, trained using PATB dataset
Sequence Labeling Transformers Arabic
B
CAMeL-Lab
9,341
1
Swahili Xlsr
Apache-2.0
Swahili automatic speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m, trained on Common Voice 8 dataset
Speech Recognition Transformers Other
S
Akashpb13
26
8
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase